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Abstract 

In this paper we analyze declarative deterministic and non-deterministic semantics for 
active rules. In particular we consider several (partial) stable model semantics, previously 
defined for deductive rules, such as well-founded, max deterministic, unique total stable 
model, total stable model, and maximal stable model semantics. The semantics of an active 
program AP is given by first rewriting it into a deductive program CP, then computing 
a model M defining the declarative semantics of CP and, finally, applying 'consistent' 
updates contained in M to the source database. The framework we propose permits a 
natural integration of deductive and active rules and can also be applied to queries with 
function symbols or to queries over infinite databases. 



1 Introduction 

Active databases is an emerging technology combining techniques from databases, 
expert systems and artificial intelligence. The main peculiarity of this technology 
is the support for automatic 'triggering' of rules in response to events. Automatic 
triggering of rules can be useful in different areas such as integrity constraint main- 
tenance, update of materialized views, knowledge bases and expert systems etc. 
dWidom fc Ueri, 1996| ). 

Active rules follow the so called Event-Condition-Action (ECA) paradigm; rules 
autonomously react to events occurring on the data, by evaluating a data dependent 
condition, and by executing a reaction whenever the condition is true. Active rules 
consist of three parts: Event (which causes the rule to be triggered), Condition 
(which is checked when the rule is triggered) and Action (which is executed when 
the rule is triggered and the condition is true). Thus, the semantics of a single 
active rule is that the rule reacts to a given event, tests a condition, and performs a 
given action. However, understanding the behavior of active rules, especially in the 
case of rules which interact with one another, is very difficult, and often the actions 
performed are not the expected ones. A very important issue in active databases 
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is the development of tools which help in the design of programs with clear and 
intuitive semantics ( |Ceri fc Fraternali, 1997| ). 

The semantics of active rules are usually given in terms of execution models, 
which specify how and when rules will be applied. However, execution models are 
not completely satisfactory since their behavior is not always clear and could result 
in nontcrminating computations. This is shown by the following example. 

Example 1 

Consider program AP where + mgr(X, P, D) (resp. — mgr(X, P, D)) in the head of the 
rule means that, if the body is true, the atom mgr(X,P,D) is inserted into (resp. 
deleted from) the databases. 



-mgr(X,P,D) «- -proj(P), mgr(X,P,D)- 

+mgr(X,P,D) «- -mgr(X,P,D), ->dif f JHgr(X, D> 

-mgr(X,P,D) «- +mgr(X,P,D), -proj(P)- 

dif f _mgr(X, D) <- mgr(X', P, D), X' ^ X- 
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Assume that the database D consists of the tuples proj(p) and mgr(x,p, d) and 
that the initial update is S = {— proj(p)}. 1 The meaning of the first rule is that if 
a project P is deleted (atom — pro j (P) in the body of the rule) all managers of the 
project must be deleted. The second rule states that if a manager X of project P 
in the department D is deleted (atom — mgr(X,P,D) in the body) and there is not a 
second manager in the department D (literal -idif f _mgr(X, D)), then the manager X 
must be re-inserted (atom +mgr(X, P, D) in the head). The meaning of the third rule 
is that if a manager X of project P in the department D is inserted (atom +mgr (X, P, D) 
in the body) and the project P does not exist (literal -iproj(P)), then the manager 
X must be deleted (atom — mgr(X,P,D) in the head). Therefore, the rules define a 
sort of constraints on the insertion and deletion of tuples to guarantee the integrity 
of data. 

The procedural evaluation of this active program applies first rule t\ and then, 
alternately rules ri and r^, which insert and delete the atom mgr(x,p, d) an infinite 
number of times. □ 

The evaluation of the active rules in the example above generates an infinite loop. 
Infinite loops can be avoided by a careful writing of rules. However, generally, it 
is very difficult to determine whether the procedural evaluation of a set of active 
results in a terminating computation. A second problem when dealing with active 
programs is that the future state of the database depends on the order in which 
active rules are fired — it seems that, under procedural semantics, the behavior of 
active rules is not easy to understand. Thus, two main properties have been identi- 
fied for active rules: termination, which guarantees that the computation terminates 
in a finite number of steps, and confluence, which guarantees that the execution 
of rules always gives a unique outcome. 2 Several techniques have been designed 

1 <5 consists of a set of updates requested by users or transactions. 

2 A third important property is observable determinism. This property is not relevant in our 
framework since we consider only the insertion and deletion of tuples as actions. 
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for checking termination and confluence properties. These techniques are mainly 
based on the definition of 'compile-time' sufficient conditions. However, there are 
terminating and confluent programs which cannot be identified by these techniques. 

Different solutions to these problems have been proposed in QBidoit fe Maabout, 1997] ) , 
where a declarative semantics is associated to active rules, and in QZaniolo, 1995| 
|Lausen et ai., 1998| ), where active rules are modeled by means of deductive rules 
with an attribute denoting the state of the computation. 

The solution proposed by Bidoit and Maabout QBidoit fe Maabout, 1997] ) is based 
on the computation of the well-founded semantics of a Datalog^ program derived 
from the rewriting of the active program. The merits of this approach are that 
the semantics of programs is well-defined and the computation guarantees both 
termination and confluence in the general case. 

However, the approach proposed in QBidoit fc Maabout, 1997| is not completely 
satisfactory since in many cases it does not capture the intuitive meaning of the 
active program. 

Example 2 

Consider the following program and the initial update S = {+conf irm(x, d)}. 
+mgr(X,D) <- +conf irm(X,D)- 

-mgr(X,D) <- mgr(X,D), ^ + mgr(X,D), +conf irm(Y, D)- 

where the literal mgr(X, D) means that X is the manager of department D whereas the 
(update) literal -i +mgr(X,D) means that the atom mgr(X,D) is not inserted in the 
database, i.e. the event associated with the insertion of the tuple does not happen. 
Since the atom +mgr(x, d) is inserted in the database, the second rule instantiated 
with X = x and D = d will not be fired; therefore the atom mgr(x, d) is not removed 
from the database. □ 

However, the semantics proposed in QBidoit fc Maabout, 1 997) is not able to de- 
duce this fact and it concludes that mgr(x, d) is undefined in the updated database. 
There are two problems with this semantics: the first is in the rewriting of active 
rules into deductive ones and the second is that the well-founded semantics in many 
cases is not satisfactory. 

To overcome these problems we revise Bidoit and Maabout 's technique by defin- 
ing a different rewriting method and propose in addition to the well-founded, other 
declarative semantics. 

The rest of the paper is as follows. In Section 2 we present preliminary definitions 
on active rules and Datalog. In Section 3 we present desirable properties which will 
be used to compare different semantics. In Section 4 we introduce deterministic 
and non-deterministic semantics for active rules. Then, in Section 5 we analyze the 
complexity of the semantics presented in the previous section. Finally, in Section 6 
we compare declarative semantics, such as that proposed in this paper, with tools 
for checking termination and confluence, proposed for procedural semantics. We 
also discuss termination and confluence in the presence of functions symbols. 
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2 Preliminary definitions 

In this section we introduce preliminary definitions and results on deductive and 
active rules. We also recall different versions of stable model semantics for DATALDG 
with negation. 

2.1 Active and deductive rules 

We assume finite countable sets of constants, variables and predicate symbols. A 
(simple) term is either a constant or a variable. A (standard) atom is of the form 
p(h, ■ ■ •, £„) where p is a predicate symbol and t\, • ■ -,t n are terms. A literal is an 
atom A or its negation ->A. Predicate symbols can be either base (EDB) or derived 
(IDB). Atoms and literals are either base or derived accordingly to their predicate 
symbols. An update atom (resp. literal) is of the form +A or — A where A is a base 
atom (resp. literal). 
A rule is of the form 

A <— Bi, ■ ■ -, B n , Ci, • • •, C m 

where A is an atom, Z?i, • ■ -,B n are update literals and Ci, • • •, C m are standard 
literals. 

Rules can be either active or deductive. Active rules have an update atom in the 
head whereas deductive rules have a standard atom in the head. A deductive rule is 
called DATALOG" rule if the body does not contain update literals. A rule is said to 
be positive if it is negation free. Positive DATALOG^ rules are called DATALDG rules. 

Let r be a rule; H(r) and B(r) represent, respectively, the head and the body 
of r. A ground DATALOG rule with no goals is called a fact. A program is a set of 
rules. A program AP is called active if it contains at least one active rule otherwise 
it is called deductive. A deductive program is called DATALOG^ if does not contain 
update literals. 

EDB predicate symbols form a relational database scheme S, thus they are also 
seen as relation symbols. The set of all databases on a given schema<S is denoted by 
D. A database D = (D + , D) onS is a set of pairs of finite relations {D + {p) 1 D(p)) 
on a countable domain U (database domain), one for each p inS such that D + (p)(l 
D(p) = 0. D + denotes the set of true facts in D whereas D is the set of unknown 
facts in D. Moreover, we denote with D~ the set of false facts in D, that is the set 
of facts which are not in the database. Observe that databases may contain, besides 
true facts, facts which are unknown. The reason for also considering unknown facts 
is to capture situations where the insertion or deletion of a tuple is undefined 
(for instance, under procedural evaluation, a tuple could be inserted and deleted 
indefinitely). A database D on S is said to be total if D = 0, i.e. there are no 
unknown facts in D. 

An update program UP is a pair (5, AP), where 5 is a set of update facts and AP 
is an active program. The set 5 contains the input update set which is used to fire 
the active rules. An update program UP defines a mapping from D to D. Thus, the 
application of an update program UP to a database D\ on<S, denoted UP(Di), gives 
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a new database D 2 on S. Our model is based on durable changes, first proposed in 
( |Zaniolo~ 995), where the input update set 5 is used to activate the rules in AP 
and only the updates derived from AP which will belong to the final update set 
are used to trigger again active rules. Thus, an update program UP, on a database 
schemed, is a recursive (i.e., computable) function from D to D. In addition to 
being computable, every update program UP is required to be C-generic, i.e. for 
any D onS and for any isomorphism p on U — C , UP(p(D)) = p(UP(D)), where 
the domain C denotes the set of constants in the program. Informally speaking, the 
output of a program does not depend on the internal representation of the constants 
in the database not appearing in the program. 

Given a database D = (D + , D) on S and an active program AP, AP d denotes 
the program obtained from AP by adding for each relation p and for each tuple t 
in D + (p) a fact p(t) and for each tuple t in D{p) a rule of the form p(t) <— ~^p(t). 
3 The set of constants in AP d defines the Herbrand universe H_ap d of AP d ■ The 
set of ground atoms built by using the constant in Hj$> D defines the Herbrand base 
Bap d °f ■APd The ground instantiation of AP d is denoted by ground(AP 1 d) ■ 



2.2 Semantics of DATALOG" 

In this section we recall the definition of (partial) stable model semantics for 
DATALOG^ programs. Partial stable semantics also applies to general logic programs, 
i.e. to programs where terms can be both simple or complex (a complex term is of 
the form f(h,- ■ ■, t n ) where / is a function symbol and ti, ■ ■ -,t n are terms). 4 We 
first recall the definition of partial stable model and next consider a restricted class 
of stable models called deterministic models. 



P-Stable Models 

Given a literal A, ->->A denotes A. Let / be a set of ground literals; then -J 
denotes the set {—1^4 1 ^4 e /}, and I + (resp., I~) denotes the set of all literals 
(resp., negated atoms) in /. Given a DATALOG^ program CP and a database D, we 
denote with / = Bcp D — (J+ U the set of facts in the Herbrand base which 

are undefined in the interpretation /. / is a (partial) interpretation of CPd if it is 
consistent, i.e. I + n = 0. Moreover, if I + U = Bcp D , the interpretation / 
is called total. 

Letting / be an interpretation for a program CPr>, then the truth value of an 
atom A 6 Bcp d with respect to interpretation /, denoted by 1(A), is equal to (i) 
true if A £ I, (ii) false if -iA e / and undefined otherwise, i.e. A £ I. We assume 
the linear order false < undefined < true and ^undefined = undefined. 

A rule A <— A\,- ■ -,A m in ground(CPo) is satisfied w.r.t. an interpretation I 

3 A rule of the form p(t) <— ^p(t) states that p(t) must be undefined since by assuming p(t) false 
we derive p(t) true and under the assumption that p(t) is true p(t) cannot be derived from the 
program since the body of the rule is false. 

4 For general logic programs we also assume a finite countable set of function symbols. 
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if 1(A) > min{I(Ai) | 1 < i < m}. An interpretation / is a model if all rules 
in ground(CP) are satisfied. The semantics of logic programs is given in terms of 
partial stable model semantics ( Gclf ond fc Lifschitz, 1988] ) which we briefly recall 
below. 

An interpretation M of CP d is a P-stable (partial stable) model if it is the 
minimal model of the positive program gl(CP 1 M) obtained from ground(CP) by 
replacing each negated body literal -*A with the complement of the truth value of 
A w.r.t. M. 

Example 3 

Consider the program 

a <— -ib- 
b <- -.d- 
c <— a, b- 

This program has three P-stable models: Mi = { } (all atoms are undefined), Mi = 
{a, ^c} and M 3 = {^a, b, ^c}. □ 

A P-stable model M of CPd is (i) T-stable (total stable) if it is a total in- 
terpretation of CPd, and (ii) well-founded if it is the intersection of all P-stable 
models of CP. For instance, in the above example, Mi is well-founded whereas 
Mi and M3 axe T-stable. T-stable model was the first notion of stable model 
and was defined in ( |Gelfond fe Lifschitz, 19881 ); the existence of a T-stable model 
for any program is not guaranteed. The well-founded model was introduced in 
jVan Gelder et al, 1991j and is obviously unique. It is well known that a well- 
founded model exists for any program; therefore, the existence of at least one P- 
stable model is guaranteed as well. 

Stable model semantics introduces a sort of non-determinism in the sense that 
programs may have more than one "intended" model ( |Gelfond fc Lifschitz, 1988| ). A 
number of interesting subclasses of P-stable model have been recognized as possible 
"intended" models of a logic program. 

In particular, the M-stable (maximal stable) models are those P-stable models 
that are not contained in any other P-stable model. Moreover, L-stable (least unde- 
fined stable) models are the M-stable models which leave a minimal set of elements 
of the Herbrand base undefined, and coincide with traditional total stable (T-stable) 
models whenever the set of undefined elements is empty. 

Example 4 

Consider the following program: 
a- 



b «— ^c- 

c «— ^b- 

P <- b, -.p- 

d <— a, -ip, ^e- 

e *— a, -ip, ^d- 

q <- -.d, -.q- 
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The P-stable models are: Mi = {a}, M 2 = {a,b, ->c}, M 3 = {a,^b, c, ^p}, 
M4 = {a, -<b, c, ->p, ->d, e}, Ms = {a, -16, c, -ip, d, ->e, Mi is the well- 

founded model; M2, M4 and M5 are the M- stable models; M5 is also both L-stable 
and T-stable. 

Now consider the above program without the first rule defining the predicate a. 
Then the P-stable models are: Mi — {->a, -id, ->e}, M2 — ~>d, ->e, b, ->c}, 

M3 = {-1 a, -id, -ie, -ib, c, ^p}. Mi is the well-founded model; M 2 and M 3 are the 
M-stable models; M3 is also L-stable but not T-stable. (The atom q is undefined in 
all P-stable models.) □ 

The non-deterministic version of the stable model semantics is based on the (non- 
deterministic) selection of a stable model ( |Greco et al, 1 995). Non-determinism 
offers a solution to overcome the limitations in expressive power of deterministic 
languages ( |Abiteboul fc Vianu, 1991||Abiteboul et al, 1990). For instance, it seems 
to be the only way to capture all polynomial-time queries without requiring the defi- 
nition of an ordering on the domains, thus renouncing the data independence prin- 
ciple | |Abiteboul et al., 1994| ). The problem with stable model semantics is that the 
expressive power can blow up without control, so that polynomial-time resolution 
is no longer guaranteed. In fact, finding a T-stable model may require exponen- 
tial time; worse, deciding whether a program has a T-stable model is A/T'-complete 
| |Marek fe Truszczynski, 1991| ). Thus, it is possible that polynomial-time queries are 
computed in exponential-time, that is, it is possible to get untoward exponential 
time resolution. 



Deterministic P-Stable Models 

In this section we present the concepts of deterministic P-stable models and 
of max-deterministic P-stable model. We point out that this issue is extensively 
analyzed in ( |Sacca fc Zaniolo, 1997| ) also for the case of infinite universe and it is 
further investigated in relation to the notion of non-determinism. The notion of 
max-deterministic model was first introduced in ( |Sacca fc Zaniolo, 1990| ) and its 
relevance was also discussed in ( |Greco fc Sacca, 1997| ). 

Let CP be a DATALDG^ program, D be a database on S and M be a P-stable 
model of CPd- M is deterministic if for every other P-stable model N of CPd, 
M and N are not contradictory, i.e. M U N is an interpretation. As proven in 
acca fc Zaniolo, 1997| ), two deterministic P-stable models are only an expression 
of assorted degrees of undefinedness inasmuch as there exists a P-stable model 
which includes both models. 

The well-founded model is obviously a deterministic model — actually, as it 
is the intersection of all P-stable models, it is the minimum deterministic model. 
The family of deterministic models, denoted by DM, has an additional property: 
there exists a maximum element in the family, the max-deterministic model, which 
includes all other deterministic models and, therefore, can resolve all the differences 
among them. 
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Fact 1 

\Sacca fc Zaniolo, 199% For every DATALOG -1 program CP, {DM, C ) is a complete 
lattice such that the bottom is the well founded model and the top is the max- 
dctcrministic model. □ 

Example 5 

Consider the following program CP 

a <— -ib- 

b «— -ia- 

c «— a- 

c <- b- 

c <— -id- 

d <— -ic- 

The program £P has four P-stable models: Mi = { }, M^ = {c, -id}, M3 = 
{a, -ifo, c, -id} and M4 = {-ia, b, c, -id}. Mi is well-founded, M3 and M4 are Te- 
stable and M2 is max-deterministic. □ 

Observe that deterministic P-stable models trade off minimal undefinedness to 
achieve determinism, but in different degrees. At the bottom, we find the well- 
founded model, which ensures better computability at the expense of more unde- 
finedness. At the top, we find the max-deterministic model, whose clear semantic 
advantages are counterbalanced by computational drawbacks, as shown below. 



3 Desirable properties of active programs 

Several desirable properties for active rules have been introduced in the literature. 
The most important properties are termination and confluence. 

Termination is the property which guarantees that the execution of active pro- 
grams terminates in a finite number of steps. It is a crucial problem in active 
databases since rules may trigger each other recursively, and consequently, the non- 
termination of active rules execution is a constant threat. 

Confluence is the property that the execution of active programs produces a final 
state (updated database) independent of the order of execution of (not prioritized) 
rules. In general, confluence is a desirable feature since the behavior of rules with 
a unique final state can be more easily understood and because most applications 
require a unique final state (e.g. materialized views applications). 

In this paper we consider a different framework based on the use of declara- 
tive semantics. All the semantics considered guarantee termination of the update 
programs, since they are based on finite domains and stable model semantics. Fur- 
thermore, the first group of proposed semantics also guarantees confluence, since 
they identify a unique particular stable model as the intended model for the update 
programs. We also propose semantics that do not guarantee confluence, since they 
nondeterministically choose a model among a set of intended models. 

Complexity and expressivity are also important interrelated properties. It is an 
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open issue whether semantics with high complexity (and expressivity) are prefer- 
able. Here we consider declarative semantics which have low complexity and ex- 
pressivity w.r.t. procedural semantics proposed in the literature. 

For declarative semantics, several other desirable properties have been proposed 
pSidoit fc Maabout, 1997| ). A first requirement is that any semantics must be founded, 
that is, the set of (positive and undefined) atoms in the model must be derivable 
from the program. A second requirement is that the set of updates to be applied 
to a given database must be consistent. 



Definition 1 

A set of update literals M is said to be 

1. conflict free if it does not contain two update facts of the form +A and — A; 

2. consistent 5 if for each atom +A G M (resp. —A £ M) it must be the case 
that -. - Ae M (resp. -> + Ae M). □ 

Finally, we consider a property of knowledge ordering among databases (i.e. set 
of atoms which can be either true or undefined) . 

Definition 2 

Given two databases D± and D2, we say that D2 is more informative than D\ or, 
equivalently, that D\ is not more informative than D2 (Di ^ D2) if D2 C D\. □ 

This property is important when we are interested in databases having maximal 
sets of true and false facts, and, therefore, our transformations should minimize the 
set of unknown facts. 



4 Declarative Semantics for Update Programs 

As discussed in the Introduction, the semantics proposed in ( |Bidoit fc Maab out, 1997| ) 
is not able to capture the intuitive meaning of all programs. Thus, in this paper we 
propose different semantics for active rules. More specifically, we consider a differ- 
ent rewriting of active rules into deductive ones and analyze several (partial) stable 
model semantics. Let us start by introducing how sets of updates are applied to 
databases. 

Definition 3 

Let D = (D + , D) be a database and let M be a consistent set of update literals. 
Then, the application of M to D, denoted M(D), gives a new database D\ defined 
as follows: 

1. p g if +p g M or (p g D + and -. - p g M); 

2. p g D\ if one of the following conditions is true 

(a) p g D and +p, -p g M + ; 

(b) p g D + and -p e ~M; 



5 This property has been called weii founded in jBidoit fc Maabout, 1997) . 
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Algorithm 1 

Rewrite update programs. 

Input Update program UP = (8,AP); 

Output DATALOG" program st{UP); 

Method 

1. insert into st(UP) the deductive rules in AP; 

2. For each active rule +a(t) <— B (resp., — a(t) <— B) in AP insert into st(UP) the 
rule +a(t) <— B , -ick_a(t) (resp., — a{t) <— B , -^ck-a(t)) where cfc_a is a new predicate 
symbol; 

3. For each rule r in st{UP) replace each update atom +p(t) (resp. — p(t)) in the body 
with +p(t) V p'(t) (resp., — p{t) V p"(t)), where p' and p" are new IDB predicate 
symbols; 

4. For each new predicate symbol cfc_a with arity n, introduced in Step 2, add a rule 
ck.a(X!,-- -,X„) <- +a(Xi,- • -, X n ), -a(Xi, ■ ■ -,X n ); 

5. For each update atom +p(i) (resp. — p(t)) £ <5 insert into st(UP) the fact p'(i) (resp., 
p"(t)) (introduced in the body of rules in Step 3); 

6. Rewrite the program in clausal form by eliminating disjunctions; 

7. Return st(UP) where every update atom +a(t) (resp., —a(t)) is interpreted as a stan- 
dard atom, i.e., +o and —a are new standard predicate symbols. □ 

Fig. 1. Rewriting of update programs 



(c) p £ D~ and +p £ M. □ 

Thus, an atom p is undefined in the new database D\ if (1) it was undefined in 
the old database D and there is no evidence about its insertion or deletion, or (2) 
it was true (resp. false) in D and its deletion (resp. insertion) is undefined in M. 

The algorithm reported in Figure 1 rewrites an update program UP into a de- 
ductive program denoted as stiUP). 

Essentially, the algorithm rewrites update atoms taking into account information 
contained in the input update S, and adds 'temporary' rules which are used to 
guarantee the consistency of the model, i.e. any model containing an atom of the 
form +a(t) must also contain the literal -> — a{t). In particular, for each update 
+a(t) (resp. — a(t)) in S a fact a'(t) (resp. a"(t)) is added to the program (Step 5) 
and every update atom +a(u) (resp. — a{u)) in the body of a rule is replaced by 
a disjunction +a(u) V a'(u) (resp., —a(u) V a"(u)) which takes into account that 
the insertion (resp. deletion) of an atom can be derived from the program or it is 
contained in the input update set S (Step 3). Moreover, programs are subsequently 
rewritten in standard form by eliminating disjunctions. 

The derived program contains update literals which are interpreted as standard 
ones, i.e. an update atom +p(t) will be interpreted, in the rewritten program, 
as a standard atom with predicate symbol +p. The declarative semantics of the 
rewritten deductive program st{UP) gives a set of literals also containing elements 
with predicate symbols of the form +p and — p; these atoms will be interpreted as 
insertions and deletions to be applied to the source database. The following example 
clarifies the rewriting of update programs. 
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Example 6 

Consider the update program UP = (S,AP) of Example 1 where S = {— proj(p)} 
and AP consists of the following rules 

-mgr(X,P,D) <- -proj(P), mgr(X,P,D)- 
+mgr(X,P,D) <- -mgr(X,P,D), -dif f _mgr(X, D)- 
-mgr(X,P,D) <- +mgr(X,P,D), -proj(P)- 
dif f jngr(X, D) <- mgr(X', P, D), X' ^ X- 

The deductive program derived from the application of Algorithm 1 before rewriting 
the disjunctions (Step 6) is 

-mgr(X,P,D) <- (-proj(P) Vproj"(P)), 

mgr(X,P,D), -ick_mgr(X,P,D)- 
+mgr(X,P,D) <- (-mgr(X,P,D)Vmgr"(X,P,D)), 

-.diff_mgr(X,D), -ick_mgr(X, P, D)- 
-mgr(X,P,D) <- (+mgr(X,P,D)Vmgr'(X,P,D)), 

->proj(P) -nckjngr(X,P,D)- 

proj(p)"- 

ck_mgr(X,P,D) <- +ck_mgr(X, P, D), — ck_mgr(X, P, D)- 
dif f _mgr(X, D) <- mgr(X', P, D), X' ^ X- 

The final program is obtained by replacing rules with disjunctions in their bodies 
by standard rules. For instance, the second rule defining the predicate +mgr(X, P, D) 
is replaced by the rules 

+mgr(X,P,D) <- mgr-(X,P,D), -idif f _mgr(X, D), -ick_mgr(X, P, D)- 
mgr-(X,P,D) <- -mgr(X,P,D)- 

mgr-(X,P,D) <- mgr"(X,P,D)- Q 

In the following we will consider only semantics which associate to any update 
program a set of stable models. Moreover, if the set of stable models is empty we 
reject the program whereas if it contains more than one element we select, non 
deterministically, one model. However, if the set of stable models is a singleton, we 
say that the semantics is deterministic whereas if all models agree on the database 
atoms we say that the semantics is confluent. It is obvious that deterministic se- 
mantics are also confluent (for instance, the well-founded semantics is an example 
of deterministic semantics). 

Consider an update program UP and a database D. Then, we denote with XS 
a generic stable model semantics and with SEMxs[WP, D] the model of st{UP)o 
under the XS semantics. Moreover, given a set of literals M, we denote with M the 
set of update literals in M. The following definition introduces the application of 
an update program UP to a database D. 

Definition 4 

Let UP = (S, AP) be an update program and let D be a database. Then, the 
application of UP to D under stable model semantics XS denoted UPxs(D), is 
defined as T XS (5(D)) where T xs = SEM xs [UP, D] . □ 
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The application of UP = (6, AP) to D is carried out in three steps: (i) first the 
set of update atoms which are derived from the standard version of UP applied 
to D is computed, (ii) next, 8 is applied to D and, finally, (hi) the set of derived 
update atoms is applied to the current database. Observe that in the application 
of an update program UP to a database D, we first apply the input update set S 
and next the updates derived from the active program AP. Thus, we give priority 
to the active program since it defines a sort of constraints which must be satisfied 
— updates in 6 could be nullified by the rules in the program. 

Observe that our semantics is based on durable changes, first proposed in ( |Zaniolo, 1995| 
Durable changes are updates which will belong to the final update set. Thus, given 
an update program UP = (5,AP) and a database D, the meaning of a negated 
update literal -> + A appearing in AP, is that the atom A is not inserted in the 
database by the application of UP to D. This, implies that rules are only triggered 
by updates which will belong to the final update set. 

Proposition 1 

Let UP be an update program and let D be a database. Then, SEM xs[WP, D] is 
consistent for every stable model semantics XS. 

Proof The proof derives directly from the rewriting of programs. Let M be the 
unique model for st{UP)o under semantics XS. Each update atom +a(t) is in M 
only if -icfc_a(i) £ M~ . Moreover, if +a(t) is true and cfc_a(i) is false w.r.t. M, 
then —a(t) must be false. □ 

Corollary 1 

Let UP be an update program and let D be a database. Then, for each stable model 
semantics XS, SEMxs[WP,D] is conflict-free. 

Proof (Sketch). Straightforward. □ 

We next analyze the impact of several deterministic and non-deterministic se- 
mantics w.r.t. the properties introduced in Section 3 and analyze their complexity 
and expressivity. In the next subsection we analyze confluent semantics whereas in 
subsection 14 . 21 we analyze non-confluent semantics based on stable models. 



4-1 Confluent semantics 

Confluence is the property that given a database, the execution of active programs 
produces a new database independent of the order of execution of rules. The fol- 
lowing example shows that the procedural evaluation of active rules may give al- 
ternative results which are not all intuitive 

Example 7 

Assume a database consisting of the four emp, dept,mgr and prom where i) a tuple 
emp(e,d) means that employee e works for the department d, ii) a tuple dept(d) 
means that d is a department iii) a tuple mgr(e,d) means that e is a manager of 
department d, and iv) a tuple prom(e, d) means that employee e is a promoted 
manager of department d. Consider the following program. 
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ri : -emp(X,D') <- +prom(X,D), emp(X,D')- 
r 2 : +mgr(X,D) «- +prom(X,D)- 
r 3 : -dept(D) «- -emp(X.D), -.dif f (E, D)- 
r 4 : -emp(X,D) <— -dept(D), emp(X,D)- 
r 5 : -mgr(X,D) «- -dept(D), mgr(X,D)- 

diff(X,D) <- emp(X',D),X' ^ X- 

diff(X,D) «- mgr(X',D),X' 7^ X- 

The first two rules say that if an employee X working for department D' is promoted 
to be manager of department D, then the tuple emp(X, D') must be deleted and a 
tuple mgr(X,D) must be inserted into the database. The third rule says that if an 
employee X working for the department D is deleted and there is no other employee 
working for D, then department D must be deleted as well. The last two active rules 
say that if a department D is deleted, all employees and managers of D must be 
deleted too. 

Assume now that the initial database is D = {dept(dl), emp(el, dl)} and that 
the input update set is 5 = {+prom(el, dl)}, i.e. the employee el is promoted to 
be a manager of department dl. The procedural evaluation of the active rules gives 
several outcomes depending on the sequence of rules evaluation. Three possible in- 
stances are the following: D\ = {dept(dl), mgr(el, dl), prom(el, dl)} (derived from 
the execution sequence (ri,r 2 ,r 3 ) D 2 = {mgr(el, dl), prom(el, dl)} and (derived 
from the execution sequence (r*i, r%, rg, r±, T2) and D3 = {prom(el, dl)} (derived 
from the execution sequence (ri, 7$, ri, rg, r^)). □ 

In general, deterministic semantics guarantees that the rewritten deductive pro- 
grams have a unique minimal model and, therefore, that the final state of the 
database is unique and well-defined. 

In this subsection we consider several confluent semantics for update programs. 
We first analyze the standard well-founded semantics, the max deterministic se- 
mantics. Then we consider the restrictions of well-founded and max-deterministic 
semantics to total transformations, i.e. to transformations which do not introduce 
undefined tuples into the database. Finally, we analyze the unique stable model 
semantics. 



4-1.1 Well-founded semantics 

We consider here the well-founded semantics and compare it with the semantics for 
update programs proposed by Bidoit and Maabout pfidoit &: Maabout, 1 997). The 
two semantics differ in their rewriting of the source program and in the application 
of the input updates to the source database. 

We first recall how programs are rewritten in ( |Bidoit fc Maabout, 1997] ) . Given 
an update program UP denote with st\BM](MP) the standard DATALOG" program 
obtained from UP by (1) replacing each rule of the form +^4 <— Body (resp., —A <— 
Body) in AP with a rule +A <— Body,^ — A (resp., —A <— Body,^ + A) and 
(2) inserting, for each update atoms +A 6 6 (resp. — A G 5), a rule of the form 
+A 4- -1 - A (resp., -A <— -> + A). 

The following example shows the different behavior of the two rewritings. 
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Example 8 

Let us consider the update program of Example 2 UP = ({+conf irm(x, d)}, AP) 
where AP consists of the rule 

+mgr(X,D) <- +conf irm(X,D)- 

-mgr(X,D) <- mgr(X,D), -. + mgr(X,D), +conf irm(Y,D)- 
The rewritten program st(UP) consists of the rules 

+mgr(X,D) <- conf irm+(X,D), -ick_mgr(X, D)- 

— mgr(X.D) <— mgr(X,D), -imgr + (x, d), conf irm + (Y, D), -ick_mgr(x, d)- 
conf irm'(x, d)- 

ckjngr(X,D) <- +mgr(X,D), -mgr(X,D)- 

conf irm+(X,D) <- +conf irm(X, D)- 

conf irm+(X,D) <— conf irm'(X, D)- 

mgr+(X,D) <- +mgr(X,D)- 

mgr+(X,D) 4- mgr'(X,D)- 

where the fact conf irm'(x, d) takes into account the content of 5. The standard 
program st[BM]{WP) contains the rules 

+mgr(X,D) «- +conf irm(X,D), - mgr(X,D)- 

-mgr(X,D) <- mgr(X,D), -. + mgr(X, D), +conf irm(Y, D)- 

+conf irm(x, d) «— -i — conf irm(x, d)- 

Assuming that the source database is empty, the well founded model of st{UP)o 
is total and contains as positive atoms conf irm'(x, d), conf irm + (x, d), +mgr(x, d) 
and mgr + (x, d) whereas the well founded model of st^ BM ^(UP) o contains as defined 
literals only +conf irm(x, d) and -i — conf irm(x, d) 6 whereas all others literals arc 
undefined. Thus, our semantics correctly concludes that mgr(x, d) is inserted into the 
database (as true fact) whereas the semantics proposed in < |Bidoit fc Maabout, 1997| ) 
cannot decide about the insertion of mgr(x, d) (it inserts the tuple mgr(x, d) as un- 
defined fact). □ 

Now let UPws[bm]{D) be the application of UP to D under the well-founded 
semantics as proposed in | |Bidoit fc Maabout, 1997] ), we have the following result: 

Theorem 1 

UPws[bm](D) ±UP WS {D). 

Proof. See Appendix. □ 

The main difference between the technique proposed in ( |Bidoit fc Maabout, 1997] ) 
and the technique proposed here is in the rewriting of atoms of the input up- 
date set. Indeed, given an update program (6,AP), the technique proposed in 
( |Bidoit fc Maabout, 1997] ) does not distinguish between atoms in the input up- 
date set S and the rules in the active program AP (atoms in S are considered as 



6 — conf irm(x, d) is false in all stable models because there is no rule defining it. 
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facts of AP) whereas our technique considers the atoms in the input update set S as 
updates performed over the input database which (subsequently) trigger the rules 
of the active program AP. Moreover, the atoms which are undefined under the se- 
mantics proposed in ( |Bidoit fc Maabout, 1997| ) which are not undefined under our 
semantics are atoms contained in the input update set S and atoms which depend 
on them. As a consequence, the rewriting proposed here is more intuitive and also 
leaves fewer atoms undefined. 



4.. 1.2 Max- deterministic semantics 

The well-founded semantics is not able to catch the intuitive meaning of all pro- 
grams. We show this by means of an example. 

Example 9 

Consider a database D and the update program (5, AP) where 5 = {+new(a)} and 
AP consists of the rules 

+emp(X) <- -1 + mgr(X), +new(X)- 
+mgr(X) <- -1 + emp(X), +new(X)- 
+worker(X) <— -1 + noworker(X), -|-new(X)- 
+noworker(X) <— -1 + worker (X), +new(X)- 
+worker(X) <— +emp(X)- 
+worker(X) <- +mgr(X)- 

The rewritten program, where rules with body atoms not defined by any rule 
have been omitted, since they cannot be used to derive any fact, is 

+emp(X) «- -n + mgr(X), new'(X), -ick_emp(X)- 

+mgr(X) <- -. + emp(X), new'(X), -.ck_mgr(X)- 
new'(X)- 

+ worker(X) <— -1 + noworker(X), new'(X), -ick_worker(X)- 

+noworker(X) «— -1 + worker(X), new'(X), -ick_noworker(X)- 

+worker(X) <— +emp(X), -ick_woker(X)- 

+worker(X) <— +mgr(X), -ick_worker(X)- 

where the definitions of the predicates ck_emp, ck_mgr, ck_worker and ck_noworker 
have been omitted. 

The application of the well-founded semantics gives a model containing as de- 
fined literals only new'(a) and all other derived atoms are undefined. However, the 
intuitive meaning of the first two rules is that if new(a) is inserted either emp(a) 
or mgr(a) should be inserted too whereas the last two rules say that worker(a) 
must be inserted if either emp(a) or mgr(a) is inserted. The third and fourth rules 
say that if new(a) is inserted either worker(a) or noworker(a) must be inserted. 
Therefore, it seems more intuitive to deduce that worker(a) should be inserted into 
the database independently on the particular insertion of emp(a) and mgr(a). This 
'intuitive' meaning is captured by the max-deterministic semantics which gives, for 
the rewritten program, a model where the atom +worker(a) is true, the atoms 
+emp(a) and +mgr(a) are undefined and the atom +noworker(a) is false. □ 
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In the above example we have seen a program where the max-deterministic model 
is more informative than the well-founded model. The next proposition shows that 
this is generally true. 

Proposition 2 
UPws{D)<UPmd{D). 

Proof. Let CP = stiUP) and let Mi and M 2 be, respectively, the well-founded 
model and the max-deterministic model of CPd- Let D' be the database derived 
from the application of 5 to D. Since M 2 C M\, from Definition 4 we derive that, 
D 2 Q Di, where D 2 — M 2 (D') and D\ = M\(D') are, respectively, the databases 
derived under the max deterministic and the well founded semantics. □ 

We will see in Section 6 that the advantage of having more informative semantics 
is counter balanced by the computational complexity. 

4-. 1.3 Total deterministic semantics 

In this section we consider a total deterministic semantics for update programs. The 
idea is to consider as 'correct' only update programs which generate total database 
and to reject the programs which introduce unknown facts into the database. 

Definition 5 

Let D\ be a total database and let UP be an update program. Then, the application 
of UP to D\ under total well-founded semantics (resp. max-deterministic semantics), 
denoted UPtws(Di) (resp. UPtmd{D\)), is equal to D 2 = UPws(Di) (resp., D 2 = 
UPmd(Di) ) if D 2 is total, otherwise it is equal to D\. □ 

Observe that the condition of totality of the semantics is on the mapping and not 
on the model of the rewritten program. We next introduce necessary and sufficient 
conditions which guarantee that the mapping UP applied to a total database gives 
a total database. 

Proposition 3 

Let UP be an update program and let D be a total database. Let M — SEMxs [MP, D] ■ 
Then, UP(D) is total iff at least one of the following conditions is true 

1. M is total, or 

2. for each atom +p e M it must be the case that p G D + , and for each atom 
— p E M it must be the case that p E D~ . 

Proof. Let UP = (S,AP) and let D\ = 5(D). Clearly Di is total since 5 is conflict 
free. 

1. If M is total then M(D\) is also total since M is conflict free. 

2. From Definition 3 we derive that an atom p £ D±~ (resp. p G D^) is undefined 
in M(D\) only if — p (resp. +p) is undefined in M. But M C M and, therefore, 
M(Di) is also total. □ 
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Example 10 

Consider the update program of Example [5] where the update atoms +emp(X) 
and +mgr(X) are replaced, respectively, by the standard atoms emp(X) and mgr(X) 
(clearly, the update literals -> + emp(X) and -> + mgr(X) are replaced, respectively, 
by the literals -iemp(X) and -imgr(X)). 

Assuming the input database D, the application of UP to D under total well- 
founded semantics does not modify the database D since the the well-founded model 
of st(UP)o contains undefined update atoms, whereas the application of UP to D 
under total max-deterministic semantics produces a new database also containing 
the atoms new(a) and worker (a). □ 

Observe that the application of the program of Example 3 is not total under the 
well-founded semantics whereas it is total under the max-deterministic semantics. 



4-1-4 Unique total stable model semantics 

We now consider a different semantics which is not based on deterministic models. 
A Datalog program may have zero, one or more total stable models associated to it. 
A possible declarative confluent semantics for a program can be given by the unique 
total stable model semantics. Thus, we give semantics only to programs which have 
a unique total stable model (programs with zero or more than one total stable 
models are rejected). It is obvious that this semantics, as well as the previous ones, 
guarantees termination and confluence. The unique total stable model semantics 
will be denoted bt UTS. 

Definition 6 

Let D\ be a total database and let UP be an update program. Then, the application 
of UP to Di under unique total stable model semantics, denoted UPuts iP\), is equal 
to D2 = M(S(Di)) if M is the unique total stable model of st(UP)D 1 , otherwise it 
is equal to D\. □ 

Example 11 

Consider a database D and the update program {5, AP) where S = {+new(a)} and 
AP consists of the rules 

+emp(X) <- -. +mgr(X), +new(X)- 

+mgr(X) «- ^ + emp(X), +new(X)- 

+worker(X) «— +new(X)- 

— worker (X) «— +mgr(X)- 

Assuming that the input database D is empty, the program st{UP)o has a unique 
total stable model containing the literals +new(a), +emp(a), -i+mgr(a), +worker(a), 
-1 — worker (a). □ 

4-2 Nonconfluent stable model semantics 

We consider now nonconfluent semantics based on stable models. The presence 
of more than one stable model does not assure confluence, but in some cases, it 
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captures in a better way the intuitive behavior of active programs. Let us now 
clarify this by means of an example where all the previously defined semantics fail 
to handle the update requests returning an incomplete database or rejecting the 
requests that do not match the intuitive meaning of the program. 

Example 12 

Consider a database D and the update program (5,AP) where <5 = {+new(a)} and 
AP consists of the rules 

+emp(X) <- -. +mgr(X), +new(X)- 

+mgr(X) <- -i + emp(X), +new(X)- 

+worker(X) «— +emp(X)- 

+worker(X) <- +mgr(X)- 

The first two rules state that if new(a) is inserted into the database either emp(a) 
or mgr(a) is inserted too. However, in both cases the atom worker(a) is inserted 
into the database. Well-founded and max-deterministic semantics do not capture 
this behavior and they conclude that the atoms +emp(a), +mgr(a) and worker(a) 
are undefined in the new database. □ 

To overcome the limits of deterministic semantics we next consider the non- 
detcrministic version of stable model semantics. Non deterministic semantics per- 
mits us to have databases without undefined elements but this implies that we 
renounce the confluence property. 



4-2.1 Total stable model semantics 

Definition 7 

Total stable model semantics. 

Let D\ be a total database, UP be an update program and let S be the set of 
total stable models for st(UP) o x ■ Then, the application of UP to D\ under the non- 
deterministic version of total stable model semantics, denoted UPrsiPx), is equal to 
(i) D2 = M(5(Di)) if S is not empty and M G S is a (non deterministically selected) 
model in S, or (ii) D\ if S is empty (e.g. there are no total stable model). □ 

The problem with this semantics is that in some cases there are programs which 
do not admit any total stable model, but could admit total transformations under 
partial stable model semantics (recall that programs admit at least one partial 
stable model) . Although the computation of this semantics is expensive (see below) , 
it guarantees that the transformations are total. 



4-2.2 Maximal stable model semantics 

To overcome the above mentioned problem, it is possible to consider restricted 
languages which guarantee the existence of at least one total stable model, com- 
putable in polynomial time dGreco et al, 19951 ). A different approach can be defined 
by selecting, nondeterministically, a meaningful stable model. We consider maximal 
stable models which are the natural extension of the total stable model semantics 
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to partial interpretations. The main difference between total and maximal stable 
models semantics is that the first guarantees totality of the semantics whereas the 
former guarantees the existence of a model for all programs. 

Definition 8 

Maximal stable model semantics. 

Let D\ be a total database, UP be an update program and let S be the set of 
maximal stable models for st(UP)o 1 - Then, the application of UP to D\ under the 
non-deterministic version of maximal stable model semantics, denoted MP ms{D\) 
is equal to D2 = M(5(D\)) where M £ S is a (non deterministically selected) model 
in S. □ 

This semantics assure that the update transformation can be computed efficiently 
(see below), but it does not guarantee total transformations and, therefore, total 
databases. That is, since the maximal model M of st(UP)D 1 is chosen in a casual 
way, M may contain undefined update atoms that when applied to D\ return an 
incomplete database D%. 

A further refinement could be obtained by restricting the selection to models 
which generate total transformations on the database. 

Definition 9 

Maximal stable model semantics with total transformations. 

Let Di be a total database, UP be an update program and let S be the set of 
maximal stable models for st(UP)D 1 - Then, the application of UP to D\ under the 
non-deterministic version of maximal stable model semantics with total transfor- 
mations, denoted MPms tt (Di), is equal to (i) D 2 = M(6(Di)), if S is not empty, 
M 6 S is a non deterministically selected model in S such that D2 is total, or (ii) 
D\, if S does not contain any model M such that M(S(D\) is total. □ 

We conclude by noting that the last semantics assures that the selection of the 
stable model is restricted to those which guarantee that the updated database is 
total. We will see in Section 5 that the selection of a model which guarantees 
the totality of the transformation has high computational complexity whereas, by 
considering also partial transformations, the complexity of the (nondeterministic) 
selection is polynomial. 

5 Complexity 

As shown in ( |Abiteboul et al., 1994| most of the operational semantics proposed in 
the literature have very high complexity and expressivity. The semantics presented 
in this paper have low complexity and they are all located under the second layer 
of the polynomial hierarchy. It seems to us that semantics with low complexity are 
more desirable for database applications. 

Classical complexity theory classifies languages on the basis of the difficulty of de- 
ciding whether a given input string belongs to the language. I/O (update) programs 
have been classified in a similar fashion, defining a recognition problem associated 
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to them. However, another equivalent way to define the computational complexity 
is based on the complexity of constructing the result. 

Let UP be an update program and let D be a database on a fixed schema S. 
Then, we say that UP, under semantics XS has complexity C , if the complexity 
of constructing the result for UPxs{D) is C. The complexity of computing, for a 
given update program UP and a database D, UP{D) depends on the underlying 
semantics, since different semantics have different complexity. 

We recall here how basic complexity classes are defined. V and MV denote the 
classes of decision problems computable in polynomial time, respectively, by a de- 
terministic Turing machine and by a nondetcrministic Turing machine. coMV is 
the class of decision problems whose complementary problems are in MV and T> p 
is the class of decision problems which can be expressed by means of two distinct 
problems, one in MP and the second in coMV . US is the class of decision prob- 
lems with unique solution 7 (Blass fc Gurevich, 1982] ), whereas A?, is the class of 
decision problems computable in polynomial time by a deterministic Turing ma- 
chine which uses an AfV oracle. It is known that V C MV C V p C A?, and that 
V C coAtP C US C T> p C A?,. Whenever we are interested in finding solutions to 
problems, we consider complexity classes for search problems (also called function 
problems). Search and decision problems are closely related and decision problems 
can be used to state negative complexity results for search problems. (We refer 
the reader to (Papad imitriou, 1 994) for the formal definition of the relationship 
between decision and search problems.) Here we consider the class FMV , FUS and 
FA?, which are the classes of search problems whose corresponding decision prob- 
lems belong, respectively, to the classes AfP, US and A?, and the class FV which 
denotes the class of search problems in FAfP that can be computed in polynomial 
time. 

Proposition 4 

Let UP = (5, AP) be an update program and let D be a database on a fixed schema 
S. Then, the complexity of computing UP(D) is 

1. FV , under WS and non-deterministic A4S semantics; 

2. FAfV , under non-deterministic TS and non-deterministic AAStt semantics; 

3. £> p -hard and in FA?,, under max-deterministic semantics; 

4. F-US, under unique total stable model semantics. 

Proof. First of all observe that the application of 6 to D can be done in polynomial 
time as well as the rewriting of UP into st(UP). Moreover, since the cardinality of 
every minimal model M for st(UP)r> is polynomial in the size of D, the application 
of M to D' = 5(D) is computable in polynomial time. Therefore we have to consider 
the complexity of computing the model M of the Datalog" program st(UP) d under 
the various semantics. 

7 Formally, US is the class of languages accepted by a nondeterministic polynomial time bounded 
Turing machine with the convention that a string is accepted if there is exactly one accepting 
computation. 
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1. For a given program CP and a database D, both the well-founded model 
and any maximal stable model can be computed in polynomial time; thus 
the complexity of computing UP(D) under both the well-founded and the 
non-deterministic version of maximal stable model semantics is FP. 

2. The problem of checking if a Datalog^ program CP has a total stable model is 
A/^P-complete; clearly the problem of finding a stable model for st{UP) d is in 
FJ\fP; therefore, the complexity of computing UP(D) under non-deterministic 
total stable model semantics is in FMP . 

Consider now the MStt semantics. Checking if a Datalog program has a 
maximal stable model M which is total w.r.t. a subset of its predicates is also 
A/^P-complete. Therefore, the complexity of computing UP(D) under non- 
deterministic A4Stt semantics is also in FMP . 

3. In ( |Sacca fc Zaniolo, 1997| ) it has been shown that computing the max-deterministic 
model is T> p hard and it is in FA^; therefore computing UP(D) under the 
max-deterministic semantics is 2? p -hard and it is in F-A^. 

4. The problem of checking if a Datalog^ program has a unique total stable 
model is in US. Therefore, the complexity of computing UP(D) under unique 
total stable model semantics is in F US. □ 

The complexity of computing update programs under a given semantics is counter 
balanced by its expressivity. Indeed, update programs are functions on databases 
and, as shown above, the well-founded semantics only permits the expression of 
a subset of transformations corresponding to polynomial problems. Total stable 
model semantics and maximal stable model semantics permit the expression of a 
larger set of transformations but they have higher complexity. The choice of the 
right semantics depends on the class of problems which our transformations want 
express. 

6 Discussion and Related Work 

Active and deductive rules can be seen as opposite ends of a spectrum of database 
rule languages ( |Widom, 1993| ). Deductive rules provide high-level declarative lan- 
guage for specifying intensional relations. In contrast, active rules are more low-level 
and often need explicit control on rule execution. Most commercial active rule sys- 
tems operate at a relatively low-level of abstraction and are heavily influenced by 
implementation-dependent procedural features. The procedural control of active 
rules makes it very difficult to understand or predict their meaning. This difficulty 
increases as rules are added or deleted and the meaning of rules is not understood 
by looking at their specifications. For these reasons, active rules so far have been 
approached with a great deal of care ( |Widom fc Ceri, 1996| ). Several tools have 
been provided to understand the run-time behaviour of active rules. These tools 
have been developed for compile-time rule analysis to understand if rule process- 
ing terminates and to know if the execution of a set of rules which are not fully 
prioritized always gives a unique outcome. 

Several techniques, based on syntactic and semantics analysis of rules, have been 
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proposed to detect termination. Syntactic analysis is based on the construction of 
a triggering graph ( |Aiken et al, 1994| ) . The nodes of the graph are rules whereas 
there is an arc from node Ti to node Tj if the execution of rule r\ can trigger 
rule Tj. If there is a cycle in the graph, the rules may trigger each other indef- 
initely and the termination property is not guaranteed. This idea was refined in 
( |Baralis fc Widom, 1994| ) where a different graph, called activation graph, was pro- 
posed and in ( |Baralis et al., 1996| ) where a combination of triggering and activation 
graphs was proposed. A different technique based on abstract interpretation has re- 
cently been proposed in ( |Bailey et al, 1997| ). It is worth noting that all the proposed 
techniques can check termination for restricted cases, i.e. they are able to determine 
possible cases of nontermination but they are not able to determine termination 
in the general case. However, generally, checking whether an arbitrary set of rules 
terminates is an undecidable problem. The problem remains undecidable also for 
rules which do not use arithmetics and function symbols. 

Confluence is guaranteed if rules are prioritized, i.e. if there is a linear order 
on priorities associated with rules, so that only one rule at time can be activated. 
Moreover, confluence can also be guaranteed in cases where more than one rule can 
be activated at the same time. Confluence tools are based on the commutativity of 
rules execution. Two rules and r, commute if, for each database D, the activation 
of Ti followed by Tj and the activation of rj followed by ri, produce the same 
database. A basic technique for checking confluence is based on the checking of 
commutativity for each pair of rules in the rule set. Less restrictive conditions have 
been defined in the literature. These are based on finding minimal sets of rules which 
must be commutative (Aik en et al., 199~4| ). However, generally, checking confluence 
for an arbitrary set of rules is undecidable. 

The analysis of rules can be further complicated by aspects such as immedi- 
ate versus deferred triggering, the use of delta relations and composite events. 
Although there has been considerable research and development in the area of 
active databases, there has been very little activity in the study of formal foun- 
dations. Some preliminary work, based on the rewriting of active rules in terms 
of deductive rules, can be found in (Bar al fc Lobo, 19 96 Zaniolo, 1993), whereas 
in l ] Widom, 1992| ) the semantics of active rules is given in terms of denotational 
semantics. The combination of deductive and active rules has been also investi- 
gated ( |Lausen et al, 1998| |Ludascher, 1998| |Zaniolo, 1993| ). All these approaches 
are based on the simulation of active rules by means of deductive rules, where 
atoms have associated a temporal argument (also called state argument). 

The approach proposed in this paper is based on the definition of a declarative 
semantics for active (and deductive) rules. Our semantics guarantees i) termina- 
tion, for programs without function symbols and ii) confluence, under a confluent 
semantics. 

In this paper we have not investigated the presence of arithmetical operators and 
function symbols. However, since the semantics we have consider apply to general 
logic programs, confluent semantics guarantee confluence in the general case. 

The presence of arithmetical operators and function symbols does not permit to 
guarantee termination since the Herbrand base is infinite. A large amount of work 
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has been devoted to the termination of query evaluation in logic programming and 
deductive databases ( |De Schreye fc Decorte, 1 994 Kif er et al, 1988||Ramakrishnan et al, 1987| 
Krishba murty et al, 19 88 Sa giv fc Vardi, 1989| ). Distinction is made between the 
problem of proving termination under top-down evaluation and proving that bottom- 
up evaluation generates finite answers. The latter property is known as safety. For 
termination under top-down evaluation we refer to the survey of De Schreye and 
Decorte QDe Schreye fe Decorte, 1994| ). The safety property for queries with func- 
tion symbols, under bottom-up evaluation, has been analyzed in ( |Kifer et al, 1988| 
|Ramakrishnan et al, 1987| |Krishbamurty et al, 1988||Sagiv fe Vardi, 1989| ). A re- 
lated problem is whether the evaluation strategy computes all answers and termi- 
nates, i.e. all intermediate relations are finite. This problem is also known as ter- 
mination or effective comput ability. Clearly, effective computability is a stronger 
property than safety, i.e. effective computability implies safety but the converse is 
not true. The problem of determining safety for queries with function symbols is, 
generally, undecidable. 

Several techniques for checking the effective computability of special classes of 
queries have been proposed in the literature. Most of the proposed techniques are 
based on the approximation of queries with function symbols by means of queries 
without function symbols over infinite relations. We mention here the restricted 
cases known as supersafety problem (i.e. the question of whether a query has a 
finite answer in any model which is a fixpoint of the operator T^p ( |Lloyd, 1987| )), 
which has been shown to be decidable and axiomatizable fKifer et al, 1988| ) and 
strong safety problem (i.e. the question of whether programs, where derived predi- 
cates used to compute other derived predicates are safe, are effectively computable), 
which has been shown to be decidable and efficiently computable. Another class of 
interest which has been studied is that of monadic queries, i.e. queries in which all 
intensional predicates are monadic. It has been shown that, for monadic queries ef- 
fective computability is decidable in polynomial time ( |Sagiv fc Vardi, 1989| ). Other 
techniques based on the identification of cycles performing monotonic operations 
with a finite bound have also been studied. 

7 Conclusions 

In this paper we have analyzed declarative semantics for active rules. The advan- 
tages of using declarative semantics are that termination is guaranteed for function 
free queries, confluence is also guaranteed by using confluent semantics and the 
semantics is well defined and intuitive. Furthermore, the framework proposed in 
this paper permits a natural integration of deductive and active rules and can also 
be applied to queries with function symbols or to queries over infinite databases. 
We have also shown that semantics which reduce the set of unknown literals have 
high complexity (and also high expressive power) . Further work should be devoted 
to investigating the relation between non-deterministic and procedural semantics; 
in both cases different computations of the same program on the same database 
could result in different outcomes. 
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Appendix 

In this appendix we recall the definition of operator W,cp used to introduce the 
well-founded semantics and report the proof of Theorem 1. 

Definition 10 

Well Founded model. Let CP be a Datalog program and let X C (BcpU^Bcp ) be an 
interpretation. The transformation Wcp(X) is equal to T cp(X) U^XJcp{X), where 
T cp is the immediate consequence operator and U cp is the greatest unfounded 
w.r.t. X as explained in Section 2. 

Furthermore, let a be a successor ordinal, we recursively define W^pi^) as X if 
o = and as W£p(W^ 1 (X)) otherwise. 

Any Gxpoint of this transformation represents a P-stable model of CP and par- 
ticularly the least Gxpoint of~Wcp is the well-founded model of CP. □ 

Theorem 1 UP WS [bm]{D) < UP WS {D) 

Proof. Let Pi = st [BM] (UP) D and P 2 = st(UP) D and let M 1 and M 2 be the 
well-founded models, respectively, of Pi and P 2 . We must show that M\(D) < 
M 2 (6(D)). Thus, it is sufficient to show that M x + C M 2 + UK and Mf C M 2 ~ where 

K = {+ P (t)\+ P (t)eSA^-p(t)eM 2 -}u{-p(t)\- P (t)e5A^ + P (t) e M 2 ~}, 

i.e., K is the set of updates in S which are not contradicted in M 2 . 
Table 1 shows the relationship between the programs Pi and P 2 . 



1 6 


AP 


| St[BM] 


(UP) 


| st{UP) | 




±p(t) <- Body- 


| ±p(t) <- Bo 


iy,^Tp(t)- 


±p(<) «— Bodyi , ->ck.p(t)- 




p(t) <— Body- 


| Pit) +- 


Body- 


| p(t) <- Bodyi- | 


1 +p(t) 




| +P(t) «- 


--P(*)- 


I p'(0 - ■ | 


1 -p(t) 




| -P(t) <" 


^+P(t)- 


I p"M - ■ | 



Table 1. Relations among rules of st[BM}(MP) and st(UP) 

Each row of the table shows how atoms in <5 (first column) and rules in the active 
program (second column) are rewritten by the technique proposed in ( |Bidoit fc Maabout, 1997| ) 
(third column) and by our technique (fourth column) . Observe that Body denotes a 
conjunction of literals (update and standard) whereas Bodyi is derived from Body 
but all update atoms are replaced by disjunction (see Section 4). Note the ground in- 
stantiations of the rewritten programs. Since auxiliary atoms are defined by facts, 
disjunctions in Bodyi are eliminated by replacing the auxiliary atoms with their 
truth value. Consequently, auxiliary facts can also be eliminated from ground(P 2 ) . 

Therefore, for each rule r 2 in ground(P 2 ) there is a rule r*i in ground(Pi) such 
that Bodyi C Body. 
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Now we can show the above containment by induction on the fixpoint operator 
W P . 

Step 0. h = h = 0. 
Step i > 0. 

Consider two interpretations l\ and 7 2 for P\ and P 2 , respectively, such that 
h C (I 2 U K h ). We must show that W(7i) C (W(/ 2 ) U K w(h) ). 

Consider, first, the containment Tp 1 (Ii) C Tp 2 (/ 2 ) U i^ W (/ 2 )- For eacn ground 
predicate a(t) G I\ there are two possibilities: 

1. [a(t) G" S-] This means that there is a rule of the form ±a(t) <— Body, -^^fa(t) 
(resp., a(t) <— Sod?/ if a(£) is not an update atom) in ground(Pi) and 
that (Body U -1 =F a (t)) C Moreover, there must be a rule ±a(t) <— 
Bodyi, ->ck_a(t) (resp., a(f) <— Bodyi if a(t) is not an update atom) in 
ground(P 2 ) and since (1) Bodyi C f?ocfa/ C (2) Bodyi does not contain 
input update literals and IjC^U if/ 2 we conclude that Bodyi ^ h ■ Further- 
more, ->ck_a(t) G I 2 if either -1 + a(i) or -1 — a{t) is in ii. 

2. [a(t) G <5-] If an update atom a(t) = +p(t) (resp., a(t) = —p(t)) belongs to 
Ii then also -* — p(i) (resp., — • + p(t)) belongs to it. But since Jf" C J 2 - , from 
the definition of Kj, we derive that a(t) G 7C/ 2 . 

Consider, now the relation Upj(/i) C Up 2 (/ 2 ). Let a(t) be a ground atom be- 
longing to Up,(/i); for each rule r 2 of ground(P 2 ) with i?(r 2 ) = a(i) there is a 
corresponding rule n in ground(P\) with #(ri) = We have that 

1. If (S(n) n ^ (the body of n is false w.r.t. /1), then (B(r 2 ) n -.J 2 ) ^ 
since the input update cannot be assumed false in I\. 

2. If (B(n) n Ui) = U n 7^ 0, then, every atom a(t) € U n cannot be an input 
update, otherwise it will be assumed false in Wp, (Ji), and, therefore, it must 
appear also in B(r 2 ). 

So we can assume that all the element of Up,(/i) must also belong to Up 2 (/ 2 ). 
Finally, ck_a(t) G Up 2 (J 2 ) if either +a(t) or —a(t) is in Up^/i). □ 



